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Amendments to the Claims: 

This listing of claims replaces all prior listings of claims: 

Listing of Claims 

1 . (Previously Presented) A method for detecting misrepresentation of policy related 
information provided to an insurer by a policyholder where the information is used by the insurer 
in determining an amount of premium to be paid for insurance coverage provided to the 
policyholder, the method comprising: 

selecting, by one or more computing systems, a plurality of insurance policies to process 
with a predictive model; for each selected policy, deriving variables from policy related 
information provided by the policyholder in connection with the selected policy, the derived 
variables not including claim related information; and 

for each selected policy, applying, by the one or more computing systems, the derived 
variables of the policy to the predictive model to generate a model score indicating the relative 
likelihood of misrepresented information provided by the policyholder or an expected adjustment 
of the premium on the policy. 

2. (Previously Presented) The method of claim 1, further comprising: 
collecting, by one or more computing systems, training data including a plurality of 

insurance policies having misrepresented information and a plurality of policies not having 
misrepresented information; developing , by one or more computing systems, the predictive 
model from the training data; and storing, by one or more computing systems, the predictive 
model. 

3. (Previously Presented) The method of claim 1 further comprising: 
converting, by one or more computing systems, the model score to a fraud score 

indicating a probability of fraud in the policy. 

4. (Previously Presented) The method of claim 1, further comprising: 
converting, by one or more computing systems, the model score to the expected 

adjustment of the premium on the policy. 
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5. (Previously Presented) The method of claim 1 , wherein selecting a plurality of 
insurance policies further comprises: 

for each policy, automatically determining, by one or more computing systems, start and 
end dates of a scoring period over which misrepresented policy information is to be detected. 

6. (Previously Presented) The method of claim 5, wherein the start and end dates of 
the scoring period for which the policy has consistent and complete data. 

7. (Previously Presented) The method of claim 5, further comprising: 
responsive to a policy not having consistent or complete data in the scoring period, 

defining, by one or more computing systems, an exclusion code providing a reason that the 
policy was not selected. 

8. (Previously Presented) The method of claim 5, wherein the insurance policies are 
workers' compensation insurance policies, and automatically determining start and end dates of 
the scoring period further comprises: 

defining, by one or more computing systems, the start and end dates such that all audit 
adjustments are contained between the start and end dates. 

9. (Previously Presented) The method of claim 1, wherein selecting a plurality of 
insurance policies further comprises: 

for each policy, receiving, by one or more compufing systems, a user defined scoring 
period to be scored for the policy; and automatically selecting, by one or more computing 
systems, those policies having consistent and complete data in the respective user defined time 
period from which the variables for the predictive model may be derived. 

10. (Previously Presented) The method of claim 9, further comprising: 
responsive to a policy not having consistent or complete data in the user defined time 

period defining, by one or more computing systems, an exclusion code providing a reason that 
the policy was not selected. 

11. (Previously Presented) The method of claim 9, further comprising: 
responsive to a policy not having consistent or complete data in the user defined scoring 
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period, automatically suggesting, by one or more computing systems, a scoring period in which 
the policy has consistent and complete data. 

12. (Previously Presented) The method of claim 1, wherein deriving variables from 
policy related information further comprises: 

determining, by one or more computing systems, a plurality of peer groups of which the 
selected policy is a member; and for each peer group or set of peer groups of which the selected 
policy is a member, deriving, by one or more computing systems, variables from the policy 
information which attribute characteristics of the peer group or set of peer groups to the selected 
policy, or which compare the selected policy to other policies in the peer group or set of peer 
groups. 

13. (Original) The method of claim 12, wherein the derived variables estimate the 
probability of a dichotomous outcome or a certain distributional statistic of a continuous quantity 
for a policy, based on the peer group(s) of which the policy is a member. 

14. (Previously Presented) The method of claim 12, wherein deriving variables for the 
policy which compare the policy to other policies in its peer group(s) further comprises deriving, 
by one or more computing systems, variables that compare either at least one characteristic e of 
the policy with at least one corresponding characteristic of the policies in its peer group(s). 

15. (Previously Presented) The method of claim 12, further comprising: 

for each of the plurality of peer groups, storing, by one or more computing systems, in a 
lookup table group statistics for policy characteristics of the policies in the peer group; and 

deriving, by one or more computing systems, the variables for a selected policy by 
determining the peer group to which the selected policy belongs and using the statistics for the 
policy characteristics for the peer group to derive the variables for the selected policy. 

16. (Previously Presented) The method of claim 15, further comprising: 
updating, by one or more computing systems, the lookup table for a peer group of the 

selected policy using policy information from the selected policy. 
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17. (Previously Presented) The method of claim 1, wherein deriving variables further 
comprises: 

deriving, by one or more computing systems, variables from the policy information 
which compare the selected policy in a selected time period with the selected policy in a time 
period prior to the selected time period. 

18. (Previously Presented) The method of claim 17, wherein deriving variables from 
the policy information which compare the selected policy in a selected time period with the 
selected policy in a time period prior to the selected time period further comprises: 

deriving, by one or more computing systems, variables which quantify an amount or 
distribution of risk-related activities associated with the policy. 

1 9. (Previously Presented ) The method of claim 1 7, wherein deriving variables from 
the policy information which compare the selected policy in a selected time period with the 
selected policy in a time period prior to the selected time period further comprises: 

determining, by one or more computing systems, at least one measure which is a 
percentage change in a policy characteristics between the selected time period and the previous 
time period. 

20. (Previously Presented) The method of claim 17, wherein deriving variables from 
the policy information which compare the selected policy in a selected time period with the 
selected policy in a time period prior to the selected time period further comprises: 

determining, by one or more computing systems, a vector policy characteristics for the 
selected time period and a vector of the policy characteristics in the prior time period; and 
determining, by one or more computing systems, a scalar measure of comparison between the 
two vectors. 

21 . (Original) The method of claim 20, wherein the scalar measure of comparison 
between the two vectors is computed as either a measure of distance between the two vectors or 
an angle measure between the two vectors. 

22. (Previously Presented) The method of claim 17, wherein deriving variables from 
the policy information which compare the selected policy in a selected time period with the 
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selected policy in a time period prior to the selected time period further comprises: 

determining, by one or more computing systems, a percent change in a payroll share in at 
least one employment classification in the selected time period relative to the previous time 
period. 

23. (Previously Presented) The method of claim 17, wherein deriving variables from the 
policy information which compare the selected policy in a selected time period with the selected 
.policy in a time period prior to the selected time period further comprises: 

determining, by one or more computing systems, a percent change in a payroll share in an 
exception group in the selected time period relative to the previous time period. 

24. (Previously Presented) The method of claim 17, wherein deriving variables from the 
policy information which compare the selected policy in a selected time period with the selected 
policy in a time period prior to the selected time period further comprises: 

determining, by one or more computing systems, a vector distance between vectors of 
payroll percent shares in each of a plurality of employment classes in the selected time period 
and in the prior time period. 

25. (Original) The method of claim 24, wherein the employment classes are SIC 
employment classes. 

26. (Original) The method of claim 24, wherein the employment class groups are NCCI 
employment class groups. 

27. (Original) The method of claim 24, wherein the employment class groups are rate- 
driven employment class groups. 

28. (Original) The method of claim 24, wherein the employment class groups are data- 
driven employment class groups, each group including employment classes that are likely to 
appear together in payroll reports. 

29. (Previously Presented) The method of claim 17, wherein deriving variables from 
the policy information which compare the selected policy in a selected time period with the 
selected policy in a time period prior to the selected time period further comprises: 
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determining, by one or more computing systems, a percent change in a number of claims 
filed on the policy in the selected time period relative to number of claims filed on the policy in 
the prior time period. 

30. (Previously Presented) The method of claim 17, wherein deriving variables from 
the policy information which compare the selected policy in a selected time period with the 
selected policy in a time period prior to the selected time period further comprises: 

determining, by one or more computing systems, a vector distance between a first vector 
of the number of claims filed in the selected time period for each of a plurality of injury types 
and a second vector of the number of claims filed in the prior time period in each of the plurality 
of injury types. 

3 1 . (Original) The method of claim 1 , wherein the insurance policies are workers' 
compensation insurance policies and the policy relative information from which the variables for 
assessing the policies are derived includes payroll reports for the policyholder. 

32. (Previously Presented) The method of claim 1, further comprising: 
deriving, by one or more computing systems, direct policy variables which measure 

characteristics of the policyholder or the policy itself without comparison to other policies or the 
same policy in a prior time period. 

33. (Original) The method of claim 32, wherein the direct policy variables are selected 
from the group consisting of: 

type of company of the policyholder; 

location of the policyholder; 

number of employees of the policyholder; 

number of policy cancellations; age of the policy; 

industry type of the policyholder; 

amount of payroll reported by the policyholder; and 

distribution of payroll reported by the policyholder with respect to at least one 
employment class. 
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34. (Previously Presented) The method of claim 1, further comprising: 
deriving, by one or more computing systems, direct claim variables which measure 

characteristics of claims filed on policy. 

35. (Original) The method of claim 34, wherein the direct claim variables are selected 
from the group consisting of: 

number of claims filed during the selected time period; 
dollar amount of claims filed during the selected time period; 
type of claims filed during the selected time period; 

number of claims filed during the selected time period relative to amount of premium 
paid during the selected time period; and 

number of claims filed during the selected time period relative to a size of payroll during 
the selected time period. 

36. (Previously Presented) The method of claim 1, further comprising: 

deriving, by one or more computing systems, variables that measure the probability of 
fraud in the policy conditionally based on at least one policy characteristic of the policy. 

37. (Previously Presented) The method of claim 1, further comprising: 
applying, by one or more computing systems, the policy to a plurality of decision rules 

which identify specific inconsistent or suspicious policy facts related to the policy, to generate an 
output indicating which decision rules were violated by the policy. 

38. (Original) The method of claim 37, wherein the decision rules are derived from 
statistical analysis of insurance policies of at least one insurer which have been determined to 
contain misrepresented policy information. 

39. (Original) The method of claim 37, wherein the insurance policies are workers' 
compensation insurance policies and wherein the decision rules are selected from a group 
consisting of: 

a decision rule that identifies as potentially fraudulent a policy that has an employment 
class code on a claim with an injury date during the selected time period but the employment 
class code for the claim is not included in payroll reports for the policy during the selected time 
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period; 

a decision rule that identifies as potentially fraudulent a policy that reports zero payroll 
during the selected time period but for which one or more certificates of insurance were issued 
during the selected time period; 

a decision rule that identifies as potentially fraudulent a policy that reports zero payroll 
during the selected time period but which has at least one claim with an injury date during the 
selected time period; 

a decision rule that identifies as potentially fraudulent a policy with an officer who is 
currently or was selectedly an officer on a different policy and where the new policy has a lower 
experience modification factor than the prior policy; and 

a decision rule that identifies as potentially fraudulent a policy that has an employment 
class code on a claim and for which no premium was reported at the time the claim was opened. 

40. (Previously Presented) The method of claim 1, further comprising: 

for each selected policy, determining, by one or more computing systems, at least one 
variable which significantly contributes to the model score for the policy; and 

outputting, by one or more computing systems, a reason for the model score with respect 
to the determined at least one variable. 

41 . (Original) The method of claim 40, wherein the insurance policies are workers' 
compensation insurance policies, and wherein the significant variable is selected from a group 
consisting of 

an indication of whether the policy has been previously audited; 
an indication of whether a reported payroll has been adjusted; 
a number of employment class codes in at least one payroll report of the policyholder 
during the selected time interval; 

a type of company of the policyholder; 

an age of the policy; 

a size of payroll of the policyholder; 

a size of a premium paid on the policy; 

an industry classification code of the policyholder; 

a distribution of payroll in at least one payroll report of the policyholder during the 
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selected time interval; 

a percent payroll share in a low rated employment class code; 

a change in a distribution of payroll in at least one payroll report of the policy- holder 
during the selected time interval compared with the prior time period; 

a change in an exception group payroll share in at least one payroll report of the 
policyholder during the selected time interval compared with the prior time period; 

a payroll share in a group of agriculture related employment classes; a payroll share in a 
group of construction related employment classes; 

a payroll share in a group of manufacturing related employment classes; a payroll share 
in a group of government related employment classes; 

a payroll share in at least one clerical employment classes; 

a number of prior cancellations of the policy; 

a ratio of the number of claims made on the policy to a size of the payroll of the 
policyholder; and 

a number of claims on the policy during the selected time interval. 

Claims 42-44 (Cancelled) 

45. (Previously Presented) A method of estimating a quantity corresponding to a set of 
entities grouped using one or more hierarchical categories, the method comprising: 

determining, by one or more computing systems, an estimate of the quantity for a first 
category corresponding to the highest level of the hierarchy; and 

for each subsequent category representing a current lower level of the hierarchy, 
adjusting, by one or more computing systems, the estimate of the quantity using an estimate for 
the current level and the estimate of the higher level, the quantity being estimated being a risk 
factor characterizing misrepresentation of policy related information provided to an insurer by a 
policyholder where the information is used by the insurer in determining an amount of premium 
to be paid for insurance coverage provided to the policyholder, wherein adjusting the estimate of 
the quantity comprises applying, by one or more computing systems, a Bayesian adjustment to 
the estimate using the estimate for the current level of the hierarchy and the estimate of the 
quantity from the higher level, the risk factor being to generate variables derived from policy 
related information provided by a policyholder in connection with a corresponding policy, the 



-10- 



Attorney's Docket No.: 35006-577F01US 



generated variables being used by a predictive model to generate a score indicating the likelihood 
of misrepresentation of policy information by the policyholder of the policy. 

46. (Previously Presented) The method of claim 45, wherein each category of the 
hierarchy has a value for the risk factor. 

47. (Original) The method of claim 45, wherein the hierarchy of categories are Standard 
Industry Classification codes (SIC), and the quantity being estimated is risk factor associated 
with each SIC code. 

48. (Canceled). 

49. (Previously Presented) A system for detecting premium fraud in an insurance 
policy, comprising: 

a database of insurance policies, each policy associated with a policyholder and having 
policy related data; and 

a computer system that implements: 

a policy selection process that selects from the database a number of policies for 

scoring; 

a variable derivation process that derives for each of the selected policies 
variables associated with the policyholder of the policy for comparing the policy to peer group 
policies, and variables for comparing the policy in a selected time period with the policy a time 
period prior to the selected time period, the variables being derived from policy related 
information provided by the policyholder in connection with the corresponding policy and not 
including claim related information; and 

a fraud detection module that receives for each policy the derived variables and 
generates a score indicating the likelihood of misrepresentation of policy information by the 
policyholder of the policy. 

50. (Previously Presented) The system of claim 49, wherein the fraud detection module 
further comprises: 

a predictive model that generates a model score indicating a relative likelihood of 
misrepresentation of policy information by the policyholder; and 
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a post scoring process that converts the model score into the fraud score indicating a 
probability of misrepresentation of policy information. 

5 1 . (Original) The system of claim 50, wherein the post scoring process converts the 
model score into an expected adjustment of premium for a policy. 

52. (Previously Presented) The system of claim 50, the computer system furtherv 
implementing: 

a rule-based process that applies a plurality of rules to a selected policy to identify 
policies suspected of premium fraud based on inconsistent or incomplete policy related 
information. 

53. (Previously Presented) A method for determining a usage strategy for processing 
insurance policies suspected of premium fraud, the suspected policies selected from a plurality of 
insurance policies, the method comprising: 

establishing, by one or more computing systems, a frequency for scoring the plurality of 
insurance policies to obtain for each policy a score indicating a relative likelihood of premium 
fraud in the policy, the scoring being based solely on non-claim information; 

establishing, by one or more computing systems, a ranking function for ranking the 
scored policies; and 

establishing, by one or more computing systems, a plurality of threshold scores, and for 
each threshold score, defining an audit action for performing on policies which have a score 
exceeding the threshold score, but not exceeding a next greater threshold score. 

54. (Previously Presented) The method of claim 53, wherein establishing a ranking 
function for ranking the scored policies further comprises: 

ranking, by one or more computing systems, the scored policies according to their scores. 

55. (Previously Presented) The method of claim 53, wherein establishing a ranking 
function for ranking the scored policies further comprises: 

ranking, by one or more computing systems, the scored policies according to an expected 
adjusted premium. 
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56. (Previously Presented) The method of claim 53, wherein establishing a plurality of 
threshold scores further comprises: 

establishing, by one or more computing systems, a first threshold score for selecting for a 
desk audit those policies having a score exceeding the first threshold score; and 

establishing, by one or more computing systems, a second threshold score for selecting 
for a field audit those policies having a. score exceeding the second threshold score, wherein the 
second threshold score is greater than the first threshold score. 

57. (Previously Presented) The method of claim 53, further comprising: 
establishing, by one or more computing systems, a set of rules for identifying policies 

suspected of premium fraud. 

58. (Previously Presented) The method of claim 53, further comprising: 
establishing, by one or more computing systems, a plurality of reason codes, each reason 

code providing an explanation for a policy receiving a score; and 

establishing, by one or more computing systems, for each of a number of reason codes, at 
least one audit action to be taken in response to a policy having a score which produces the 
reason code. 

59. (Previously Presented) A method for processing insurance policies suspected of 
premium fraud, the method comprising; 

scoring, by one or more computing systems, each of a plurality of insurance policies with 
a predictive model to generate for each policy a score indicating a relafive likelihood of premium 
fraud, the score being derived from solely from policy information and not claim information; 

ranking, by one or more computing systems, the scored policies according to the scores; 

selecting, by one or more computing systems, for a desk audit those policies having a 
score exceeding a first threshold score; and 

selecting, by one or more computing systems, for a field audit those policies having a 
score exceeding a second threshold score, wherein the second threshold score is greater than the 
first threshold score. 
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60. (Previously Presented) A method for processing insurance policies suspected of 
premium fraud, the method comprising: 

scoring, by one or more computing systems, each of a plurality of insurance policies 
with a predictive model to generate for each policy a score indicating a relative likelihood of 
premium fraud, the score for each policy being based on measures of characteristics of the 
policy, policyholder, payroll, and non-claim activity related to the policy; 

determining, by one or more computing systems, for each scored policy an expected 
premium adjustment; ranking the scored policies according to their expected premium 
adjustment; selecting for a desk audit those policies having an expected premium adjustment 
exceeding a first threshold amount; and 

selecting, by one or more computing systems, for a field audit those policies having an 
expected premium adjustment exceeding a second threshold amount: wherein the second 
threshold amount is greater than the first threshold amount. 

61 . (Previously Presented) A method of developing a predictive model of insurance 
premium fraud, the method comprising: 

collecting, by one or more computing systems, from at least one insurance company 
policy information for a plurality of insurance policies; 

determining, by one or more compufing systems, for each policy a scoring period for 
scoring the policy; 

selecting a training set of policies; 

deriving, by one or more computing systems, for each policy in the training set a plurality 
of variables from the policy information and from other information relevant to policy premiums, 
the set of derived variables excluding claim activity information; 

applying, by one or more computing systems, the derived variables to an untrained 
predictive model to train the predictive model to produce a measure with respect to whether the 
policies are fraudulent or non-fraudulent during their respecfive scoring periods; 

for each policy in the training set, providing, by one or more computing systems, a 
random value for the previously audited variable, and applying, by one or more computing 
systems, the derived variables and the random value of the previously audited variable to the 
predictive model; and 

for each policy in the training set, providing, by one or more computing systems, actual 
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value for the previously audited variable indicating whether the policy was previously audited 
for the scoring period, and applying, by one or more computing systems, the derived variables 
and the actual value of the previously audited variable to calibrate the scores produced by the 
predictive model; and 

selecting, by one or more computing systems, a subset of the derived variables for the. 
using in the predictive model, which variables significantly contribute to a prediction of whether 
a policy is fraudulent during its scoring period. 

62. (Previously Presented) The method of claim 6 1 , wherein the insurance policies are 
workers' compensation insurance policies, further comprising: 

excluding, by one or more computing systems, from the training set policies for which no 
payroll is reported during the scoring period for the policy. 

63. (Previously Presented) The method of claim 61, further comprising: 

tagging, by one or more computing systems, each of the policies to indicate whether the 
policy is fraudulent, non-fraudulent or indeterminate; and 

excluding, by one or more computing systems, from the training set policies which are 
tagged as indeterminate. 

64. (Canceled). 
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